Time-compressing natural and synthetic speech
نویسنده
چکیده
Phoneme detection is a useful tool to compare the perception of perfectly intelligible speech types. As previous research suggests that perception of fast speech is helped by segmental redundancy, we expected the hyperarticulation of synthetic speech to turn into an advantage at a fast rate. Consequently, the processing advantage of natural over synthetic speech was expected to decrease after time-compression. Secondly, detection times were expected to be slower after moderate timecompression because of the higher processing difficulty of fast speech. However, detection times tended to become shorter in the time-compressed condition. This was attributed to shorter durations of syllables and words. Furthermore, the processing advantage of natural over synthetic speech did not decrease, but rather tended to increase. This may be explained by the lack of a speaking effort pattern in synthetic diphone speech, which makes it rather blurred at faster playback rates.
منابع مشابه
Recognition of time-compressed speech does not predict recognition of natural fast-rate speech by older listeners.
This study investigated whether recognition of time-compressed speech predicts recognition of natural fast-rate speech, and whether this relationship is influenced by listener age. High and low context sentences were presented to younger and older normal-hearing adults at a normal speech rate, naturally fast speech rate, and fast rate implemented by time compressing the normal-rate sentences. R...
متن کاملThe Temporal Delay Hypothesis: Natural, Vocoded and Synthetic Speech
Including disfluencies in synthetic speech is being explored as a way of making synthetic speech sound more natural and conversational. How to measure whether the resulting speech is actually more natural, however, is not straightforward. Conventional approaches to synthetic speech evaluation fall short as a listener is either primed to prefer stimuli with filled pauses or, when they aren’t pri...
متن کاملThe effect of filled pauses and speaking rate on speech comprehension in natural, vocoded and synthetic speech
It has been shown that in natural speech filled pauses can be beneficial to a listener. In this paper, we attempt to discover whether listeners react in a similar way to filled pauses in synthetic and vocoded speech compared to natural speech. We present two experiments focusing on reaction time to a target word. In the first, we replicate earlier work in natural speech, namely that listeners r...
متن کاملVoice Onset Time and the Perception of Japanese Voicing Contrasts
Much crosslinguistic research exists on the production and perception of voice onset time (VOT). However, most research on the perception of VOT uses synthetic stimuli instead of natural speech stimuli. Effects of synthetic speech on the perception of VOT are not known, but more research needs to be done to see if there are differences between perception using synthetic speech and perception us...
متن کاملPerception of Speech Rate and Naturalness in Synthetic Slow Speech
This paper details two perception experiments based on synthetic British English obtained with CART models predicting phone durations in slow speech from normal speed speech. Speech rate and naturalness were assessed by 6 English natives. Synthetic slow speech was rated as both slower and less natural than natural slow speech; however, the insertion of the pauses produced in natural slow speech...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002